beam-search implementation for more exhausting sampling #35

laurcate · 2023-03-31T08:46:28Z

Issue #, if available:

Description of changes:
this function largely replaces A2RL Simulator.gpt_sample_n_steps(). It does not
concern states/actions/rewards and only generates the next N tokens using beam search.
This function is to be used by a BYO planner downstream as a replacement of the normal sampling strategy.

Additionally a notebook with example of implementaiton has been added.

Testing done:
Yes

Merge Checklist

Put an x in the boxes that apply. You can also fill these out after creating the PR. If you're
unsure about any of them, don't hesitate to ask. We're here to help! This is simply a reminder of
what we are going to look for before merging your pull request.

[x ] I have read the CONTRIBUTING document.
I have added tests that prove my fix is effective or that my feature works (if appropriate).
I have updated any necessary documentation, including README and docs (if appropriate).

By submitting this pull request, I confirm that my contribution is made under the terms of the
Apache 2.0 license.

requirements.txt

Hopefully this fixes failing tests

github-actions · 2023-05-04T02:48:03Z

Coverage Report

File	Stmts	Miss	Cover	Missing
src/a2rl
__init__.py	21	1	95%	35
_io.py	68	1	99%	353
simulator.py	619	61	90%	456–458, 471, 537, 543, 558–590, 596, 908, 1007, 1022, 1032, 1037, 1053, 1074, 1096, 1122, 1143, 1149, 1166, 1170, 1176, 1186, 1193, 1216, 1222, 1293, 1306–1309, 1320–1321, 1344, 1403, 1410, 1465, 1472, 1478, 1485, 1488, 1599, 1615–1617, 1626, 1650, 1689, 1692
tokenizer.py	116	2	98%	64–65
utils.py	161	22	86%	51, 60–63, 74–76, 108, 147, 164, 179–181, 336–339, 475–484, 544, 558
src/a2rl/experimental/lightgpt
lr_decay.py	20	1	95%	30
model.py	116	2	98%	260, 263
simulator.py	35	1	97%	162
src/a2rl/mingpt
model.py	118	4	97%	57, 203, 208, 230
trainer.py	84	14	83%	50–51, 55–57, 108–111, 116, 124–126, 134, 140–141
TOTAL	1592	109	93%

Tests	Skipped	Failures	Errors	Time
240	0 💤	4 ❌	0 🔥	27.600s ⏱️

…ater completely drop it)

src/a2rl/simulator.py

verdimrc · 2023-05-04T04:00:09Z

Thank you @laurcate , @patrick22414. Feel free to merge to main.

Ignore the linter error -- looks like a newer Black wants to reformat even files outside this PR.

Recommend you to review the missing coverage in your new function (see the coverage report a few comments above). I'm okay if you want to add extra tests for the edge cases as a separate PR.

verdimrc · 2023-05-13T02:45:56Z

Thank you for the additional test cases.

beam-search pr

68e6675

verdimrc reviewed May 4, 2023

View reviewed changes

requirements.txt Outdated Show resolved Hide resolved

Update requirements.txt

b6371dd

Hopefully this fixes failing tests

Verdi March added 3 commits May 4, 2023 11:03

Bump typeguard to minimum version 3.0.0

4334747

Support pandas>=1.5.0 (which deprecates df.iteritems() and sometime l…

bc91e90

…ater completely drop it)

Improve docstrings

20cdf3f

verdimrc reviewed May 4, 2023

View reviewed changes

src/a2rl/simulator.py Outdated Show resolved Hide resolved

verdimrc approved these changes May 4, 2023

View reviewed changes

more tests & error for beam_width too large

d5a4366

verdimrc merged commit d0dab0f into main May 13, 2023

verdimrc deleted the feature/beam-search branch May 13, 2023 02:46

verdimrc mentioned this pull request May 16, 2023

Release 1.2.0 #36

Merged

3 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

beam-search implementation for more exhausting sampling #35

beam-search implementation for more exhausting sampling #35

laurcate commented Mar 31, 2023

github-actions bot commented May 4, 2023 •

edited

Loading

verdimrc commented May 4, 2023

verdimrc commented May 13, 2023

beam-search implementation for more exhausting sampling #35

beam-search implementation for more exhausting sampling #35

Conversation

laurcate commented Mar 31, 2023

Merge Checklist

github-actions bot commented May 4, 2023 • edited Loading

verdimrc commented May 4, 2023

verdimrc commented May 13, 2023

github-actions bot commented May 4, 2023 •

edited

Loading